KMID : 1022420150070040049
|
|
Phonetics and Speech Sciences 2015 Volume.7 No. 4 p.49 ~ p.55
|
|
Korean Semantic Similarity Measures for the Vector Space Models
|
|
Lee Young-In
Lee Hyun-Jung Koo Myoung-Wan Cho Sook-Whan
|
|
Abstract
|
|
|
It is argued in this paper that, in determining semantic similarity, Korean words should be recategorized with a focus on the semantic relation to ontology in light of cross-linguistic morphological variations. It is proposed, in particular, that Korean semantic similarity should be measured on three tracks, human judgements track, relatedness track, and cross-part-of-speech relations track. As demonstrated in Yang et al. (2015), GloVe, the unsupervised learning machine on semantic similarity, is applicable to Korean with its performance being compared with human judgement results. Based on this compatability, it was further thought that the model¡¯s performance might most likely vary with different kinds of specific relations in different languages. An attempt was made to analyze them in terms of two major Korean-specific categories involved in their lexical and cross-POS-relations. It is concluded that languages must be analyzed by varying methods so that semantic components across languages may allow varying semantic distance in the vector space models.
|
|
KEYWORD
|
|
semantic similarity in Korean, semantic relatedness, lexical relation, cross-POS-relations
|
|
FullTexts / Linksout information
|
|
|
|
Listed journal information
|
|
|
|